Goto

Collaborating Authors

 generalized block-diagonal structure pursuit


Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer

Neural Information Processing Systems

In multi-task learning, a major challenge springs from a notorious issue known as negative transfer, which refers to the phenomenon that sharing the knowledge with dissimilar and hard tasks often results in a worsened performance. To circumvent this issue, we propose a novel multi-task learning method, which simultaneously learns latent task representations and a block-diagonal Latent Task Assignment Matrix (LTAM). Different from most of the previous work, pursuing the Block-Diagonal structure of LTAM (assigning latent tasks to output tasks) alleviates negative transfer via collaboratively grouping latent tasks and output tasks such that inter-group knowledge transfer and sharing is suppressed. This goal is challenging, since 1) our notion of Block-Diagonal Property extends the traditional notion for square matrices where the $i$-th column and the $i$-th column represents the same concept; 2) marginal constraints on rows and columns are also required for avoiding isolated latent/output tasks. Facing such challenges, we propose a novel regularizer by means of an equivalent spectral condition realizing this generalized block-diagonal property. Practically, we provide a relaxation scheme which improves the flexibility of the model. With the objective function given, we then propose an alternating optimization method, which not only tells how negative transfer is alleviated in our method but also reveals an interesting connection between our method and the optimal transport problem. Finally, the method is demonstrated on a simulation dataset, three real-world benchmark datasets and further applied to personalized attribute predictions.



Response for " Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task 1 Assignment against Negative Transfer " 2 ID3136

Neural Information Processing Systems

Response for "Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent T ask We thank all the reviewers for their valuable comments. We have fixed the typos pointed out by the reviewers. Is the framework limited only to linear models? Thm.3, the generalization ability will be promising if the loss is small (not necessarily only the optimal value) and the In this sense, a local critical point would be a good candidate solution. Are the constraints in the Obj included in the class H (L, S, null S, U)?


Reviews: Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer

Neural Information Processing Systems

In this work, authors proposed a framework for multi-task learning, where there is assumed to be latent task space, and the learner simultaneously learn the latent task representation as well as the coefficients for these latent variables, namely the Latent Task Assignment Matrix (LTAM). Authors further imposed block-diagonal structure on the assignment matrix, and developed spectral regularizers for it. Authors then proposed a relaxed objective that can be optimized via a scheme similar to block coordinate descent. Authors also provided generalization learning guarantees as well as the structure recovery performance. Simulation experiments showed that the proposed algorithm can recover the true structure and provide improvement in prediction accuracy.


Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer

Neural Information Processing Systems

In multi-task learning, a major challenge springs from a notorious issue known as negative transfer, which refers to the phenomenon that sharing the knowledge with dissimilar and hard tasks often results in a worsened performance. To circumvent this issue, we propose a novel multi-task learning method, which simultaneously learns latent task representations and a block-diagonal Latent Task Assignment Matrix (LTAM). Different from most of the previous work, pursuing the Block-Diagonal structure of LTAM (assigning latent tasks to output tasks) alleviates negative transfer via collaboratively grouping latent tasks and output tasks such that inter-group knowledge transfer and sharing is suppressed. This goal is challenging, since 1) our notion of Block-Diagonal Property extends the traditional notion for square matrices where the i -th column and the i -th column represents the same concept; 2) marginal constraints on rows and columns are also required for avoiding isolated latent/output tasks. Facing such challenges, we propose a novel regularizer by means of an equivalent spectral condition realizing this generalized block-diagonal property.


Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer

Yang, Zhiyong, Xu, Qianqian, Jiang, Yangbangyan, Cao, Xiaochun, Huang, Qingming

Neural Information Processing Systems

In multi-task learning, a major challenge springs from a notorious issue known as negative transfer, which refers to the phenomenon that sharing the knowledge with dissimilar and hard tasks often results in a worsened performance. To circumvent this issue, we propose a novel multi-task learning method, which simultaneously learns latent task representations and a block-diagonal Latent Task Assignment Matrix (LTAM). Different from most of the previous work, pursuing the Block-Diagonal structure of LTAM (assigning latent tasks to output tasks) alleviates negative transfer via collaboratively grouping latent tasks and output tasks such that inter-group knowledge transfer and sharing is suppressed. This goal is challenging, since 1) our notion of Block-Diagonal Property extends the traditional notion for square matrices where the $i$-th column and the $i$-th column represents the same concept; 2) marginal constraints on rows and columns are also required for avoiding isolated latent/output tasks. Facing such challenges, we propose a novel regularizer by means of an equivalent spectral condition realizing this generalized block-diagonal property.